42 research outputs found

    UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions

    Get PDF
    The Universal PBM Resource for Oligonucleotide Binding Evaluation (UniPROBE) serves as a convenient source of information on published data generated using universal protein-binding microarray (PBM) technology, which provides in vitro data about the relative DNA-binding preferences of transcription factors for all possible sequence variants of a length k (ā€˜k-mersā€™). The database displays important information about the proteins and displays their DNA-binding specificity data in terms of k-mers, position weight matrices and graphical sequence logos. This update to the database documents the growth of UniPROBE since the last update 4 years ago, and introduces a variety of new features and tools, including a new streamlined pipeline that facilitates data deposition by universal PBM data generators in the research community, a tool that generates putative nonbinding (i.e. negative control) DNA sequences for one or more proteins and novel motifs obtained by analyzing the PBM data using the BEEML-PBM algorithm for motif inference. The UniPROBE database is available at http://uniprobe.org.National Institutes of Health (U.S.) (R01 HG003985)National Science Foundation (U.S.). Graduate Research Fellowship Progra

    Expression-Guided In Silico Evaluation of Candidate Cis Regulatory Codes for Drosophila Muscle Founder Cells

    Get PDF
    While combinatorial models of transcriptional regulation can be inferred for metazoan systems from a priori biological knowledge, validation requires extensive and time-consuming experimental work. Thus, there is a need for computational methods that can evaluate hypothesized cis regulatory codes before the difficult task of experimental verification is undertaken. We have developed a novel computational framework (termed ā€œCodeFinderā€) that integrates transcription factor binding site and gene expression information to evaluate whether a hypothesized transcriptional regulatory model (TRM; i.e., a set of co-regulating transcription factors) is likely to target a given set of co-expressed genes. Our basic approach is to simultaneously predict cis regulatory modules (CRMs) associated with a given gene set and quantify the enrichment for combinatorial subsets of transcription factor binding site motifs comprising the hypothesized TRM within these predicted CRMs. As a model system, we have examined a TRM experimentally demonstrated to drive the expression of two genes in a sub-population of cells in the developing Drosophila mesoderm, the somatic muscle founder cells. This TRM was previously hypothesized to be a general mode of regulation for genes expressed in this cell population. In contrast, the present analyses suggest that a modified form of this cis regulatory code applies to only a subset of founder cell genes, those whose gene expression responds to specific genetic perturbations in a similar manner to the gene on which the original model was based. We have confirmed this hypothesis by experimentally discovering six (out of 12 tested) new CRMs driving expression in the embryonic mesoderm, four of which drive expression in founder cells

    An Integrated Strategy for Analyzing the Unique Developmental Programs of Different Myoblast Subtypes

    Get PDF
    An important but largely unmet challenge in understanding the mechanisms that govern the formation of specific organs is to decipher the complex and dynamic genetic programs exhibited by the diversity of cell types within the tissue of interest. Here, we use an integrated genetic, genomic, and computational strategy to comprehensively determine the molecular identities of distinct myoblast subpopulations within the Drosophila embryonic mesoderm at the time that cell fates are initially specified. A compendium of gene expression profiles was generated for primary mesodermal cells purified by flow cytometry from appropriately staged wild-type embryos and from 12 genotypes in which myogenesis was selectively and predictably perturbed. A statistical meta-analysis of these pooled datasetsā€”based on expected trends in gene expression and on the relative contribution of each genotype to the detection of known muscle genesā€”provisionally assigned hundreds of differentially expressed genes to particular myoblast subtypes. Whole embryo in situ hybridizations were then used to validate the majority of these predictions, thereby enabling true-positive detection rates to be estimated for the microarray data. This combined analysis reveals that myoblasts exhibit much greater gene expression heterogeneity and overall complexity than was previously appreciated. Moreover, it implicates the involvement of large numbers of uncharacterized, differentially expressed genes in myogenic specification and subsequent morphogenesis. These findings also underscore a requirement for considerable regulatory specificity for generating diverse myoblast identities. Finally, to illustrate how the developmental functions of newly identified myoblast genes can be efficiently surveyed, a rapid RNA interference assay that can be scored in living embryos was developed and applied to selected genes. This integrated strategy for examining embryonic gene expression and function provides a substantially expanded framework for further studies of this model developmental system

    Survey of variation in human transcription factors reveals prevalent DNA binding changes

    Full text link
    Published in final edited form as: Science. 2016 Mar 25; 351(6280): 1450ā€“1454. Published online 2016 Mar 24. doi: 10.1126/science.aad2257Sequencing of exomes and genomes has revealed abundant genetic variation affecting the coding sequences of human transcription factors (TFs), but the consequences of such variation remain largely unexplored. We developed a computational, structure-based approach to evaluate TF variants for their impact on DNA binding activity and used universal protein-binding microarrays to assay sequence-specific DNA binding activity across 41 reference and 117 variant alleles found in individuals of diverse ancestries and families with Mendelian diseases. We found 77 variants in 28 genes that affect DNA binding affinity or specificity and identified thousands of rare alleles likely to alter the DNA binding activity of human sequence-specific TFs. Our results suggest that most individuals have unique repertoires of TF DNA binding activities, which may contribute to phenotypic variation.National Institutes of Health; NHGRI R01 HG003985; P50 HG004233; A*STAR National Science Scholarship; National Science Foundatio

    Using a structural and logics systems approach to infer bHLHā€“DNA binding specificity determinants

    Get PDF
    Numerous efforts are underway to determine gene regulatory networks that describe physical relationships between transcription factors (TFs) and their target DNA sequences. Members of paralogous TF families typically recognize similar DNA sequences. Knowledge of the molecular determinants of proteinā€“DNA recognition by paralogous TFs is of central importance for understanding how small differences in DNA specificities can dictate target gene selection. Previously, we determined the in vitro DNA binding specificities of 19 Caenorhabditis elegans basic helix-loop-helix (bHLH) dimers using protein binding microarrays. These TFs bind E-box (CANNTG) and E-box-like sequences. Here, we combine these data with logics, bHLHā€“DNA co-crystal structures and computational modeling to infer which bHLH monomer can interact with which CAN E-box half-site and we identify a critical residue in the protein that dictates this specificity. Validation experiments using mutant bHLH proteins provide support for our inferences. Our study provides insights into the mechanisms of DNA recognition by bHLH dimers as well as a blueprint for system-level studies of the DNA binding determinants of other TF families in different model organisms and humans.National Institute of General Medical Sciences (U.S.) (DK068429)National Institute of General Medical Sciences (U.S.) (HG003985)European Union (PROSPECTS HEALTH-F4-2008-201648

    Contribution of Distinct Homeodomain DNA Binding Specificities to Drosophila Embryonic Mesodermal Cell-Specific Gene Expression Programs

    Get PDF
    Homeodomain (HD) proteins are a large family of evolutionarily conserved transcription factors (TFs) having diverse developmental functions, often acting within the same cell types, yet many members of this family paradoxically recognize similar DNA sequences. Thus, with multiple family members having the potential to recognize the same DNA sequences in cis-regulatory elements, it is difficult to ascertain the role of an individual HD or a subclass of HDs in mediating a particular developmental function. To investigate this problem, we focused our studies on the Drosophila embryonic mesoderm where HD TFs are required to establish not only segmental identities (such as the Hox TFs), but also tissue and cell fate specification and differentiation (such as the NK-2 HDs, Six HDs and identity HDs (I-HDs)). Here we utilized the complete spectrum of DNA binding specificities determined by protein binding microarrays (PBMs) for a diverse collection of HDs to modify the nucleotide sequences of numerous mesodermal enhancers to be recognized by either no or a single subclass of HDs, and subsequently assayed the consequences of these changes on enhancer function in transgenic reporter assays. These studies show that individual mesodermal enhancers receive separate transcriptional input from both Iā€“HD and Hox subclasses of HDs. In addition, we demonstrate that enhancers regulating upstream components of the mesodermal regulatory network are targeted by the Six class of HDs. Finally, we establish the necessity of NK-2 HD binding sequences to activate gene expression in multiple mesodermal tissues, supporting a potential role for the NK-2 HD TF Tinman (Tin) as a pioneer factor that cooperates with other factors to regulate cell-specific gene expression programs. Collectively, these results underscore the critical role played by HDs of multiple subclasses in inducing the unique genetic programs of individual mesodermal cells, and in coordinating the gene regulatory networks directing mesoderm development.National Institutes of Health (U.S.) (Grant R01 HG005287

    The transmembrane protein Perdido interacts with Grip and integrins to mediate myotube projection and attachment in the Drosophila embryo

    No full text
    The molecular mechanisms underlying muscle guidance and formation of myotendinous junctions are poorly understood both in vertebrates and in Drosophila. We have identified a novel gene that is essential for Drosophila embryonic muscles to form proper projections and stable attachments to epidermal tendon cells. Loss-of-function of this gene - which we named perdido (perd) - results in rounded, unattached muscles. perd is expressed prior to myoblast fusion in a subset of muscle founder cells, and it encodes a conserved single-pass transmembrane cell adhesion protein that contains laminin globular extracellular domains and a small intracellular domain with a C-terminal PDZ-binding consensus sequence. Biochemical experiments revealed that the Perd intracellular domain interacts directly with one of the PDZ domains of the Glutamate receptor interacting protein (Grip), another factor required for formation of proper muscle projections. In addition, Perd is necessary to localize Grip to the plasma membrane of developing myofibers. Using a newly developed, whole-embryo RNA interference assay to analyze genetic interactions, perd was shown to interact not only with Grip but also with multiple edematous wings, which encodes one subunit of the Ī±-PS1-Ī²PS integrin expressed in tendon cells. These experiments uncovered a previously unrecognized role for the Ī±PS1-Ī²PS integrin in the formation of muscle projections during early stages of myotendinous junction development. We propose that Perd regulates projection of myotube processes toward and subsequent differentiation of the myotendinous junction by priming formation of a protein complex through its intracellular interaction with Grip and its transient engagement with the tendon cell-expressed laminin-binding Ī±PS1-Ī²PS integrin.B.E. was funded in part by the Programa Ramon y Cajal from the Spanish Ministry of Education

    Reciprocal regulatory interactions between the Notch and Ras signaling pathways in the Drosophila embryonic mesoderm

    No full text
    Convergent intercellular signals must be precisely integrated in order to elicit specific biological responses. During specification of muscle and cardiac progenitors from clusters of equivalent cells in the Drosophila embryonic mesoderm, the Ras/MAPK pathwayā€”activated by both epidermal and fibroblast growth factor receptorsā€”functions as an inductive cellular determination signal, while lateral inhibition mediated by Notch antagonizes this activity. A critical balance between these signals must be achieved to enable one cell of an equivalence group to segregate as a progenitor while its neighbors assume a nonprogenitor identity. We have investigated whether these opposing signals directly interact with each other, and we have examined how they are integrated by the responding cells to specify their unique fates. Our findings reveal that Ras and Notch do not function independently; rather, we have uncovered several modes of cross-talk between these pathways. Ras induces Notch, its ligand Delta, and the epidermal growth factor receptor antagonist, Argos. We show that Delta and Argos then synergize to nonautonomously block a positive autoregulatory feedback loop that amplifies a fate-inducing Ras signal. This feedback loop is characterized by Ras-mediated upregulation of proximal components of both the epidermal and fibroblast growth factor receptor pathways. In turn, Notch activation in nonprogenitors induces its own expression and simultaneously suppresses both Delta and Argos levels, thereby reinforcing a unidirectional inhibitory response. These reciprocal interactions combine to generate the signal thresholds that are essential for proper specification of progenitors and nonprogenitors from groups of initially equivalent cells.M.K.B. is supported by the Society of Memorial Sloan-Kettering Cancer Center and by National Institutes of Health Grant GM 56989. Support for A.C. is from an EMBO Postdoctoral Fellowship and a Human Frontiers Postdoctoral Fellowship. M.S.H. is an American Cancer Society Postdoctoral Fellow. A.M.M. is an Associate Investigator of the Howard Hughes Medical Institute.Peer reviewe

    Ras Pathway Specificity Is Determined by the Integration of Multiple Signal-Activated and Tissue-Restricted Transcription Factors

    No full text
    Ras signaling elicits diverse outputs, yet how Ras specificity is generated remains incompletely understood. We demonstrate that Wingless (Wg) and Decapentaplegic (Dpp) confer competence for receptor tyrosine kinaseā€“mediated induction of a subset of Drosophila muscle and cardiac progenitors by acting both upstream of and in parallel to Ras. In addition to regulating the expression of proximal Ras pathway components, Wg and Dpp coordinate the direct effects of three signal-activated (dTCF, Mad, and Pointedā€”functioning in the Wg, Dpp, and Ras/MAPK pathways, respectively) and two tissue-restricted (Twist and Tinman) transcription factors on a progenitor identity gene enhancer. The integration of Pointed with the combinatorial effects of dTCF, Mad, Twist, and Tinman determines inductive Ras signaling specificity in muscle and heart development.M. K. B. is supported by the Society of Memorial Sloan-Kettering Cancer Center and by National Institutes of Health grant GM 56989. Support for A. C. is from an EMBO Postdoctoral Fellowship and a Human Frontiers Postdoctoral Fellowship. M. S. H. is an American Cancer Society Postdoctoral Fellow. A. M. M. is an Associate Investigator of the Howard Hughes Medical Institute.Peer reviewe
    corecore